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ABST’^AC^ 

T’his study is concarned with certain aspects of 
approximation theory which can be introduced into the mathematics 
curriculum at the secondary school level. The investigation examines 
existing literature in mathematics which relates to this subject in 
an ef-<=ort to determine what is available in the way of mathematical 
concepts pertinent to this study. As a result of the literature 
review the material collected has been arranged in a structured 
mathematical form and existing mathematical theory has been extended 
to make the material use-^ul to instructional problems in high school 
algebra. 'T’te results of the study are found in the expository 
material which comprises the maior portion of this report. This 
material contains ideas of approximation theory which relate to 
elementary algebra. Also, included is a collection of references for 
this material. The report concludes that certain techniques of 
approximating a root o-p a polynomial by finding roots of a derived 
polynomial can be presented in a manner which is suitable for courses 
in elementary algebra. C'^L) 
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SUMMARY 

It is a well-knovm mathematical principle that certain solutions 
to an algebraicj equation may be approx^ated by considering an 

derived from the original equation by "if ^ort of nrLetoe 

equation. Estimates of the errors introduced by this sort of procedure 

are triven in advanced courses in mathematics. At present, mathematical 

irstSotion at the secondary level does not f •. 

This is true oven though other areas of science instrac ion 
for the basic concepts involved in this principle. 

The fundamental question for this investigation is: Is there a 

systematic body of mathematical knowledge which covers this approximd- 
tioHeotoiqu^^^ i. it possible to present this J-owledge ^ con- 
venient form whereby a program of instruction can be established in 
courses in elementary algebra at the secondary school level. Ihe basic 
task is to answer this question. 

In order to answer this question it was first necessary to make a 
rather extensive examination of the existing literature in raaf ematios 
tf find out whi? was already available in the way of mathematical con- 
Lpts pertinent to the study, A search was made 
ture to determine if an attempt had been made to relate 
mation teolmiques to elementary algebra at the secondary sonool level. 

The search of the literature revealed some sources of mathematical 
infoiraation but no reference was found in the educational literat^e o 
the subject of this study. The mathematical concepts were classified j 
areas and those pertinent to the study were put in a convenient stiuc- 
tured mathematical foim for future use. 

One significant observation was made about the level of mathemati- 
cal concept! found in the literature. Ataost all were at an advanced 
level and those which might be pertinent to the study were found jx r - 
search journals in the period from 1900 to 1930. The . 

matioal research in recent years has not included 

oifio study. As a result of this, many of the ideas presented in tne 
expositOTy^material in Appendix A had to be developed and/or extended 

as original research. 

The criteria used to determine if the concepts be in 

elementary algebra and if these concepts were to ''f. 

pxnositorv material were the following: Is it pertinent approximation 

thLry? Is it good mathematics which is readable by the''averap 
school mathematics teacher? Is it related to high school ^ 

sense it depends upon strictly algebraic and/or 

the material presented in the expository section at a ^ 

school teacher can understand? Is it adaptable to what a hig 
student Imows about mathematics? 

The results of the study are found in the expository material in 
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Appendix A. In this material one can find the ^ 

mation theory as related to elementary algebra. It is “*'® 

basic question of this study can be answered and in 

Certain aspects of approximation theory can be presented lath sme rigor 

and/or in an intuitive manner to make an additional contribution t 

mathematics education at the secondary school level. 

has two significant aspects. One, it broadens the range °f 

concepts w^ch are available at a given experience level. Secondly, it 

provides the kind of mathematical experience which the 

can use and provides it early enough to be relevant to the first courses 

in these sciences at the secondary school level. 

A by-product of the study has been the additional insight into 
mathematics obtained by the principle 5.nvestigator during ®. . . 

the investigation. There is no quantitative way to measure this, but it 

has been significant. 

The principle recommendation which should be made as the result of 
the findings is the continuing of this work to the last and ultimate test 
of its value. Some teaching units should be prepared and be used in m 
experimental situation to detemine the teachability of the concepts out^ 
liLd in the expository material. The expository material is essentially 
background for the units and should not be used as teaching material t 
the secondary school level. It would be appropriate for prospective 
mathematics teachers. 
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I, PROBLEM UNDER CONSIDERATION 



Certain equations which arise from physical problems prove to be 
difficult to solve because of the numerical computations which have to 
be completed. Sometimes an approximation for a solution of the original 
equation can be obtained by ignoring a given term of the equation and 
solving the “reduced'* equation. For example: 



An equation such as 



( 1 ) 



sS • 



1100 



3.1 



can be transformed into the equation 



( 2 ) 



d 6. 2d 

TE ^ 



9.61 



1100 " 1 , 210,000 

A solution to (2) is approximated by solving the equation 



(3) 



d 

TE 



6. 2d 

nod 



9.61 



Equation (3) is obtained from (2) b-'” “ignoring" or deleting the term in 
d^. The solution of (3) is approximately lUl and this comes reasonably 
close to satisfying (2) and, hence, satisfying (l). 



One might be tempted to conclude that this is ^ “accident" of the 
equation involved but this is not the case. Physicists use the proce- 
dure just described quite frequently and chemists are known to use a 
similar procedure in the solution of some of their equations. Moreover, 
there appears to be a desire on the part of teachers in these sciences 
that some instruction take place in the mathematics curriculum on 
precisely these procedures. 



The fundamental question which should be answered is • 



Is there a systematic body of mathematical knowledge which covers 
such examples as just described (and consequently others) and is it 
possible to present this knowledge in some convenient form whereby a 
program of instruction can be established in courses in elementary 
algebra at the secondary level? 



The last half of the question depends upon answers to the first half of 
the question and several other factors which do not readily lend them- 
selves to research and analysis. These other factors, such as teachers, 
curriculum, etc., are not necessarily included in this study. However, 
a significant portion of the fundamental question can be answered if the 
specific objectives of investigation are: 
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l(a) to determine if any significant body of knowledge exists in approxi- 
mation theory which could be made available for secondary mathe- 
matics instruction, and 

l(b) if such information is available, to systemize this information in 
order that secondary teachers of science and mathematics might 
have a significant and structured elementary mathematical model 
to use in relating mathematics and science, 

2, to extend this information in whatever manner necessary to make the 

mathematical model more convenient for use at the secondary 
level and 

3. to present the various aspects of the findings of the first three 

objectives in an expository form which would be readable by the 
’’average” secondary mathematics teacher. 

One rather clear limiting factor imposed within the objectives just 
noted is the relevancy of the material to be investigated to the secondary 
school level. This means, in essence, that no mathematical concepts 
should, enter into the results of the investigation which do not have a 
basis within high school Algebra or Geometry. Any approximation te-^.h- 
niques which depend upon the Calculus are automatically excluded from the 
investigation. For a similar reason infinite series are also excluded 
even if there might be a partial justification for including this ap- 
proach on the grounds that one or two high school textbooks include some 
discussion of the subject. What should be clear is that the results of 
the investigation depend upon the ’’usual” (and perhaps traditional) con- 
tent of the subjects of Algebra and Geometry. This limitation has par- 
ticularly significant implications for educational applications of the 
results of this study. 

II. METHODOLOGY FOR THE INVESTIGATION 



The procedures followed in this study were essentially the same as 
those of any scholarly investigation. These procedures are given here by 
phases mth appropriate comments on each phase. 



Phase 1. An extensive examination of existing literature related to 
the general area of approximation theory was made. This was done in order 
to determine what is already known in the area and what might be appro- 
priately related to this investigation. Library resources, such as the 
Educational Index, were examined for references to the topic. Books in 
the areas of numerical techniques, engineering applications, computer 
sciences and any area which might make use of approximation techniques 
were examined for suitable materials and for references to other possible 
sources. The mathematical journals, both pure and applied, were examined 
for papers and references on approximations. Members of the mathematical 
community were consulted for sources of information and the free ex- 
change of ideas with their members has been most helpful during the in- 
vestigation. 
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Phase 2. After the material and sources of literature had been 
collected, it was necessary to examine each reference found to determine 
if it was related to the problem and if it might be useable material. 
This was clearly a judgment decision determined by the previously men- 
tioned specific objectives of the study. It was not always clear, at 
that point, that some source was necessary for future use. This was 
minimized by the establishment of certain criteria for selection of 
materials. These criteria were established before the investigation 
actually began and are enumerated here. In many instances during the 
search of the literature a source was rejected at that point because it 
obviously failed to meet these criteria. 



The criteria used adcthe following questions: 



(a) 



Is 



it pertinent? Does it really belong in approximation theory? 
(Some material was related, but quite secondary in nature.) 



(b) Is it good sound mathematics? Is it readable by the "average” high 

school mathematics teacher? (Much periodical literature in mathe- 
matics is presented so abstractly only another research mathema- 
tician can read it.) 

(c) Is it related to high school Algebra? Does it depend upon strictly 

algebraic and/or geometric concepts (deducible from these subjects 
at the high school level)? (Some methods depend upon Calculus and 
were not specifically suited for this project.) 

(d) Is this material at a level that a high school teacher can under- 

stand? (This means making a decision on what most teachers Icnow 
and what their training has been. It is clear from various 
studies that this is not as high as desired and for this study to 
have any impact or application it must produce something which 
most teachers can study and, hence, learn.) 

(e) Is it adaptable to what a high school student knows about mathe- 

matics? This follows naturally from (d). 

(f) If more than one technique is available to accomplish some approxi- 

mation procedure, which one best lends itself to teaching and 
learning theory? 

These criteria were adequate for the job intended, except the last 
one was never actually used. The reasons for this are given in the 
findings . 

Phase 3. The next step was to order the material by areas of appli- 
cation. As it turned out this was essentially done conciirrently with 
Phase 2, and frequently as early as Phase 1. For instance, to order or 
group desirable information on procedures to obtain approximations to 
roots of an algebraic polynomial into one area called for all material 
on Horner* s rule, Newton's method and others to be classed together as a 
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certain general approximation concept. In contrast, the approximation of 
a solution of a quadratic equation by solving a linear equation obtained 
from the quadratic constitutes an entirely different order of approxima- 
tion concepts. The areas of grouping were somewhat arbitrary but, gene- 
rally speaking, were determined by what the investigators found in the 

literature . 

Phase U. The next phase consisted of arranging the accumulated in- 
formation in some conveniently structured mathematical form. This was 
essentially a matter of developing the mathematical aspects of existing 
theory into a logically coherent system which can be used by teachers of 
secondary school mathematics. In some instances this was easily done due 
to the circumstcLnces but in others this was possible only as a result of 
work done during Phase 

Phase S, Existing mathematical theory was extended, wherever possi- 
ble and where needed, in a manner to make the system of Phase 1+ more com- 
plete and useful in application to instructional problems in Algebra. 

Some very large gaps were noted diiring Phase h in certain areas. While it 
was not anticipated that significant mathematical research would be needed, 
it turned out that the literature left significant gaps in the mathematics 
(which may be filled by information known or assumed by others and not 
available to this investigation). These gaps were filled by mathematics 
generated by the principal investigator . 

Some aspects of this phase began as early as Phase U and extended in- 
to Phase 6, with a great deal of this kind of activity in the last phase. 

Phase 6. The last step of the investigation was the preparation of 
an expository presentation with adequate reference to explicit mathema- 
tical development which covered the structured system developed and ex- 
panded in Phase The presentation was prepared in a form consistent 
vjii-.h the objectives of the study and special emphasis was placed upon the 
level of abstractness of the material. The expository development is 
found in Appendix A of this report. 

III. FINDINGS MID RESULTS 

Since the major portion of this investigation is the preparation of 
some expository material, it would be proper to include that material at 
this point. However, due to the structure of the expository material, it 
will be found in Appendix A, along with the collection of references for 
thD.s material. However, there are several observations which are perti- 
nent to the investigation as a whole and these are made at this time. 

A review of the literature reveals two rather significant facts. 
First, there is a sizeable body of knowledge available in various places 
about approximation theory. Nearly all of this is in advanced texts, 
journal articles and technical reports. All of these sources are 
available to mathematicians who regularly require their use and are^ 
versed in th . general area. A good deal of the recent information is 
frequently linked to numerical analysis and the computer. One also 
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finds the application sources are frequently the best sources of 
readable mathematics in this area. This suggests that most mathematics 
is so abstractly written and presented that only a relatively small seg- 
ment of the population has an opportunity to digest and use the informa- 
tion presented in such form. 

The second significant fact obtained from the review of the litera- 
ture is that there is no source on approximation theory which presents 
in an elementary fashion those aspects of approximation theory which are 
used in moderate amounts by the various scientific disciplines found in 
secondary school courses of study. This was somewhat anticipated and was 
a motivation factor of this investigation. However, it was anticipated 
that some sources, maybe obscure, would be found. This was not the case. 
In fact, the search for this kind of material continues, even as the 
final report is being prepared. 

Another related observation was the recognization that mathematics 
changes rather rapidly. Prior to 193^ a great deal of attention was de- 
voted to algebraic polynomials and their solutions. By 19li0 there are 
almost no references in periodical literature to such matters and fewer 
references are to be found since then. The general area of mathematics 
which had its central theme in courses called Theory of Equations has 
essentially disappeared or been absorbed by other areas of mathematics. 
This shift in interest by mathematicians probably contributed to the 
shortage of useable material on certain topics in approximation theory 
and forced the creation of certain theorems to fill gaps left by this 
shift. Their theorems may not be as original as some but no references 
have been found for them. The theorems are identified in the expository 
material by a D in the theorem number, such as Theorem D2. 

IV. CONCLUSIONS AND RECOMMDATIONS 

The conclusions of the investigation are to be found in the exposi- 
tory material as this was to be the major task of the study. As has been 
previously indicated the material is not as extensive as had been antici- 
pated and there might be an aspect which could have been included, but 
the expository material is representative and respectable mathematics. 

There are some recommendations which should be made as a result of 
this study and all are within the capability of educational endeavors. 
First, and not necessarily dependent upon this study but obviously 
recognized in carrying out the investigation, is the need for expository 
materials in mathematics which translate some phases of the subject into 
a form which high school teachers can use. This project has attempted 
to do this for an area of mathematics (applied in nature) and there are 
many and perhaps more important areas which need the same treatment. 

For instance, the social sciences are becoming more mathematical as 
these areas develop the t'cience aspect of their endeavors. The present 
high school mathematics is almost totally mathematical or physical 
science orientated. If the other areas are important to our society, 
there should be some literature which describes these relationships and 
which suggests how it might be adapted to existing patterns of curricu- 
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lum and instruction. Examples of problems from economics, psychology, and 
other disciplines are appropriately a part of mathematics instruction and 
expository material would seem to be the best avenue to get these ideas 
into the hands of teachers. 

Second, a natural consequence of this study would be the preparation 
of suitable material for lessons for high school students to use. This 
is essentially the adaption of this report to a practical situation. 

This set of lessons should be tested in a classroom situation to deter- 
mine the success of the whole idea. Mathematically, there is no reason 
why it cannot be done. Perhaps there are non-mathematical reasons why 
approximation theory of any sort has no place in secondary school mathe- 
matics, but none are now knovm. 

Finally, the investigation revealed the lack of material at any 
level on the general area of **theory of equations.” It would seem that 
in our rush into ^modern mathematics” we may have left out significant 
educational concepts which ought to be restored. It is unlikely that the 
college and university mathematics courses will include these ideas and^ 
perhaps a major portion of it can go into high school mathematics. It is 
mathematically feasible and has already been done on an experimental 
basis . 
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I. Introduction . 

The general area of approximation theory is quite broad, too broad 
to be adequately covered in a publication of this size. However, there 
are aspects of this general theory which are quite pertinent to secondary 
school mathematics and on more than one occasion (for example, see Bowen 
[U3) science teachers have suggested that some portions of this material 
might be quite appropriate for the curriculum in mathematics and at an 
early stage in the secondary schools. The question of what portions are 
available and appropriate is essentially an unanswered question as far as 
the literature is concerned. Moreover, if there are portions which are 
appropriate, can these portions be organized in a manner which makes good 
mathematical sense and which is strictly dependent upon a bare minimum of 
secondary school mathematics? 

The purpose of this material is to provide some affirmative answers 
to these questions and to present some aspects of approximation theory 
which might be appropriate for secondary school mathematics. This we 
shall do in the following sections. We begin by examining some history 
of mathematics and some types of approximation. Following this we will 
present the mathematics of one of these types by citing pertinent theo- 
rems and facts. Most of the theorems are already in the literature some- 
place and they only need to be assembled into a meaningful presentation. 
There are included certain theorems which were not found but were needed 
to fill gaps in the area. 

One very important principle was kept in mind in the presentation 
of this material. It should be material which the ’’average” high school 
teacher can use with relative ease and it should be based upon elementary 
enough mathematics so that it could be adapted to a classroom situation. 
The latter restriction essentially means a year of algebra and maybe a 
reasonably good acquaintance with graphing. 

II. Historical Background on Solving Equations . 

In this section we will give a very brief review of the historical 
development of the solving of equations and in particular, the solving of 
the quadratic and cubic equations. The purpose in reviewing the back- 
ground of these equations is to recognize that approximations played an 
important part in the early history of solving of equations, and to recog 
nize that this process was a long, evolutionary one. We shall not be 
complete in our presentation, but will touch upon only the significant 
highlights of that history. This will be accomplished by first treating 
the quadratic, then the cubic and finally all other degrees of algebraic 
polynomials . 

1. The Quadratic Equation. The earliest solutions of problems involving 
equations were obtained by trial. At least no written record has been 
found to indicate otherwise. By 1^00 B.C. the Egyptians (as described in 
in the Ahmes Papyrus) were able to make a trial guess and adjust the re- 
sults of the trial guess in a manner which gave the correct solution of 



the equation under consideration. This technique was called the "Rulfs 
of False Position" and the procedure was used by subsequent students of 
mathematics as late as the 1700' s. 

A similar technique called the "Rule of Double False Position" was 
used rather extensively for many centuries. This technique, as well as 
the other one, were subsequently replaced by our modem techniques but 
only after the algebraic symbolism was developed by Vieta and others. 

A study of the history of early mathematics (before l600) reveals 
that most solving of algebraic type equations was done by trial and 
error with the initial step being to make a guess or approximation of the 
solution and make subsequent adjustments on the basis of the first ap- 
proximation. This procedure was applied to all degrees of polynomial 
equations which arose from the problems to be solved. 

In the Berlin Papyrus (c. 2160-1700 B.C.) one finds the first known 
solution of a quadratic equation. The problem reduces to solving the 
equations 



x2 y2 ■ 100 and y " J ^ • 

The solution technique reduces to a simple case of the "Rule of False 
Position." 

The Greeks were able to systematically solve quadratic equations by 
geometric means. Euclid (c. 300 B.C.) gave solutions and procedures to 
solve problems such as 

xy = k and x - y * a 



and a(a - x) = x^. 

The Hindus may have been able to solve quadratic equations by 
^00 B.C. although no record of the method of solution has been found. 

By 500 A.D. we do find a rule, relating to the sum of geometric series, 
which shows that the solution of the quadratic equation was known, but 
we have no rule for the solution of the equation itself. About 628 A.D. 
Brahmagupta gave a definite rule for solving a quadratic. Smith [19] 
gives the steps of solution for an example of a quadratic (x2 - IQx ■ -9) 
which turns out to be, in modem terminology, 

X . ^ - ■ -9. 

One might note the similarity of the solution to the quadratic formula 
except he found only one solution, i.e., the positive one. This was 
quite typical of all early attempts to solve equations in that only posi- 
solutions were acceptable. 
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Mahavira (c. 8^0) had a way of solving quadratics for a positive 
root, but did not write it out. However, an analysis of his solution 
leads one to believe he knew substantially the modem rule for finding a 
positive root of a quadratic. 

The Hindu Rule was first given about 102^ and was widely used in 
India. 

Al-Khowarismi (c. 82^) gave two general methods of solving quadra- 
tics of the form 



+ px ■ q . 

Negative roots were neglected. Omar Khayyam (c. 1100) also gave a rule 
for solving the equation 

x^ px “ q . 

The techniques of solution mentioned above are typical of ways of 
solving quadratics until about l600. Solutions for other higher degree 
equations were essentially based upon an approximation and a subsequent 
adjustment, as in "Rule of False Position". 

In 1631 Harriot gave the first important presentation of the solu- 
tion of a quadratic (and other equations) by factoring. 

Vieta (c. I 6 OO) replaced the geometric method of solving quadratics 
by an analytic method. The solution of the equation x^ + ax b ■ 0 was 
given as 



X ~ ^ a i ~ Rb . 

The symbolism introduced by Vieta and others of his time made possi- 
ble such a representation and a closeness to the complete solution of any 
quadratic. It is easy to see that the trial and error stage is being re- 
placed by a systematic and complete method of solving a quadratic. 

2. The Cubic Equation. The earliest known (c. 3^0 B.C.) cubic equation 
was of the form x3 = h although some Babylonian tablets give tables of 
cubes about two thousand years earlier. The problem of the duplication 
of the cube (said to have been knoxm by Hipprocrates (c. R 60 B.C.)) de- 
pends upon the finding of two mean proportionals between two given lines. 
This means to find x and y in the equations 

£ - i . 

X y b • 

Archimedes referred to a problem of cutting a sphere by a plane so that 
the two segments shall have a given ratio. The problem reduces to the 
proportion 
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C - X 



and this produces the cubic x3 + c2b - cx^. Eutorius (c. ^60) tells how 
Archimedes solved the problem by finding the intersection of two conics. 

Diophantus apparently solved the equation x 3 +x®l 4 X +U> possibly by 

noting that x(x 2 + l) *■ U(x 2 + l). 



Several Arabic mathematicians, notably Almahani (c. 860 ), Tabit ibn 
Qorra (c. 87 O), Abu Ja far al-Khayin (c. 960) and Alhazen (c. 1000) dis- 
cussed the cubic and at least one found solutions to certain equations 
in a manner similar to Archimedes. 



Omar Khayyam (c. IlOO) specif iced thirteen forms of the cubic that 
had positive roots. This is a distinct advance in general theory but a 
long way from complete solutions for any cubic. However, he did expand 
the number of cubics of certain forms which could be solved by the 
"intersection of conics" techniques. Most Arab writers believed that 
the cubic equation, in general, was impossible to solve. 



From about 1200 to 1^00 several writers in western civilization 
mentioned the cubic but were unable to solve all cubics, although several 
special cubics were solved by special techniques, such as factoring. 



The real interest in the cubic lies in the work of Cardan and 
Tartaglia. History does not clearly describe who should receive the 
credit and the stories surrounding the Cardan-Tartaglia controversy make 
interesting reading. It is clear that the solution for the cubic 
x 3 + ax 2 - c was obtained by 1 ^ 3 ^ and very shortly after that a method 

of solution for the cubic x3 + bx ** c was found. Cardan in his Ars Magna 
(lE>li^) showed how to transform the first form into the second and suc- 
ceeded in transforming the general cubic x3+ax +bx+c "0 into a re- 
duced equation type x3 + px - q. This he already knew how to solve ^d, 
therefore, every cubic was solvable. Cardan's examples showed positive 
and negative roots, the latter being included for the first time. 



Vieta ( 161 ^) subsequently found the transformation which reduced 
the general cubic to the form y3 + 3by = 2c and, by a second transfoma- 
tion or substitution, arrived at a form 2 P + 2 cz-^ = b and the latter 
was solved as a quadratic in z3. 



Again history reveals the struggle to solve an equation ^d the 
subsequent success some two thousand years after the known existence of 
the equation. The cubic equation can be solved although some of the ^ 
solutions involve complicated radicals and these are frequently approxi- 
mated for ease of computation. The application of a cubic to a physical 
problem almost necessitates the approximation in order to have a suita- 
ble ansi^er to use in a physical sense. 
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3 . The Quartic Equation. One should probably suspect that any conside- 
ration of a quartic equation would be minor during the time the cubic was 
unsolved. History reveals this to be the case# One can find only slight 
mention of the Quartic prior to the time of Cardan. The first serious 
consideration of the Quartic came in l5U0, almost immediately on the 
heels of the solution of the cubic. 
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A young Italian student by the name of Ferrari solved the problem 
by reducing the equation to an equation involving a cubic. Cardan did a 
great deal to clarify the process and spread the knowledge of the method 
in his book Ars Magna . 



Not all solutions were necessarily included as negative ones were 
frequently ignored. Vieta (c. 1^90) and Descartes (l637) ^proved the 
system of solution and the modem form is due to Simpson (17h5)« 

Ij., The Quintic Equation. Euler found a method different from Ferrari' s 
for reducing the solution of the general quartic equation to that of a 
cubic equation. He attempted to apply this technique to the Quintic in 
hopes of reducing the solution of it to the solution of the quartic and 
thereby solve the quintic. However, he failed as did 
the world of mathematics. These failures prompted Ruff ini (1803, 1805 ; 
to try and show that the quintic could not be solved by such means. 

Abel and Falois eventually resolved the question. Galois ( 181*6) 
essentially answered the total question in his posthumously published 
v 7 orks . Abel had earlier shown (182I|.) that the roots of the general 
quintic cannot be expressed in terms of its coefficients by means of^ 
radicals. This we are able to do for the quadratic, cubic and quartic. 
Intuitively it would seem possible to be able to do the same for the 
quintic, but it is not. 



5 . Solutions of other equations. The question of higher degree equations 
does not seem to have interested mathematicians of western civilization 
\mtil quite late. The examination of the equations of higher order takes 
two major directions. The first is to find out from the equation all you 
can about the roots without actually solving for them. (We often do this 
for the quadratic by looking at the value of its discriminant.) Cardan 
may have known about the Rule of Signs which, tells how many positive 
roots were possible. Harriot (l62l) may have made the formulation of the 
Rule of < Signs and Descartes (1637) certainly gave such a Rule. 



The second direction for the solution of equation of higher order 
was to approximate solutions and by some process of iteration subse- 
quently obtain better approximations • The Chinese scholars of the 13th 
and Ihth centuries were outstanding in this area and this seems to be 
China's particular contribution to mathematics. In 121*7 Ch in Kiu-shao's 
writing ref lect a high degree of perfection in this aspect of equation 
solving and equivalent form of Homer's method (l8l9) is given. 



Fibonacci (1225) attempted some improvements as did Vieta (I6OO). 
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Newton (l669) simplified Vieta’s work and perfected an approximation 
technique. Horner (l8l9) further simplified Newton’s work. 

It is interesting to note that Newton’s method was replaced by 
Horner’s method (as can be observed in any of the Theory of Equations 
books) because of the ease of hand computation. However, with the ad- 
vent of computers, some of the earlier methods are better ones to use 
because they adapt easier to machine language. Progress does not always 
make old things obsolete. 

6. The Fundamental Theorem of Algebra. Perhaps the most interesting 
problem related to the solution of equations was the apparent disregard 
of negative solutions long after techniques were available to find them. 
It is not too difficult to understand the ignoring of complex solutions, 
as concepts of complex numbers had not been fully developed. However, 
there came a time (c. 1608) when mathematicians began to assert that an 
equation of nth degree has exactly n roots. It was restated at various 
times without proof and Gauss (1799) gave the first rigorous demonstra- 
tion of the theorem. 

The theorem, together with the techniques of the solving of linear, 
quadratic, cubic and quartic, completes the major theory of equation 
solving. For the quintic and others of higher order, the theory is not 
quite as complete, but complete as mathematically possible. 

7. Review and Summary of Historical Background. We have just sketched 
the basic highlights of the solution of certain types of equations. It 
should be noted that all early attempts at any particular type of equa- 
tion were essentially based upon approximations (and, in some instances, 
adjustments on these approximations). After rather lengthy periods of 
time, solutions were found for just one, then another and then another 
type of equation. Only after the symbolism became fully developed does 
one find significant contributions in this area. 

For those equations which no techniques of solution were found, 
mathematicians seemed to be able to approximate roots of equations with 
some degree of accuracy. Subsequently, certain techniques were deve- 
loped to improve a given approximation, usually by an iteration process. 

In recent times the users of mathematics, i.e., physicists, 
chemists, etc., have begun to solve certain equations by deleting some 
term of the equation and solving the resulting equation. This is fre- 
quently done even when formulas are avai3.able to solve the starting 
equation. This is essentially an approximating technique and tends to 
indicate a cyclic treatment of the subject as far as history is con- 
cerned. It also suggests that a second stage should follow at some fu- 
ture time and a general theory be founded to justify the procedures and 
approximations now being used. In what follows we partially provide an 
indication of what this second stage should contain. 
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III . Types of Approximations , 



When one begins to examine approximation theory it becomes clear that 
there are different types of approximations which may arise from or be 
related to different circumstances. In this section we will examine a 
few different types which will serve to illustrate different aspects of 
approximation theory. Some are elementary and have already been seen even 
by the beginning high school student. 



1. Approximating a number by another number. In the elementary 
school arithmetic one finds the establishment of a decimal representation 
of the rational (and sometimes irrational) numbers by giving decimal 
equivalents to certain fractions. Thus 



1 



1 

H 



1 

E 



.2$ 

. 12 ^ 



The list is usually limited to the more common (in usage) fractions, 
However, we may see statements like this: 



- .333 



j = .666 



or, in sane instances. 



22 



TT rj or IT “ 3 • 1^ • 



As every good student of mathematics knows, these last statements are not 
actually correct. The fraction 1 is not .333> and it is neither £2 nor 3.1U* 

5 7 



What should be properly written is a statement that ^ is approximately 



22 



.333 or that n is approximately or 3.1U* This is sometimes written 

with a variety of symbols for the equality but a /w or « distinguishes 
these examples from the first ones. Thus we may write 



i « .333 , 



22 



and IT « ~ or TT « 3.1U . 
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What we really mean in these instances is that ,333 is a decimal 



whose value is reasonably close to the value of ^ and that 3«1U is 



reasonably close to value of tt. This is usually done in computational 
situations either because we do not know the exact value of some number 
(tt for instance) or because we are willing to settle for an answer 
reasonably close to some number. For example the circumference or area 
of a circle with radius of 10 can be precisely given by the formula 



2TTr 



and A = rrr^ 



which in this case become 



20tt 



and A * lOOrr • 



If, because the problem depends upon physical data or because we want to 
know approximately what C and A are^we may use an approximation for tt 
and obtain 



C « 20(3.1U) or C « 62.80 



and A « 100(3.11;) or ^ ^ 3114 



Thus 62.80 and 31lt become approximations for the exact values 20tt and 
IOOtt. 



When public officials say l60 million Americans watched the recent 
walk on the moon they are approximating (actually an estimate or 6J^®ss) 
the actual number who did watch. When an engineering student counts the 
number of steps between two points and multiplies this by 3 he determines 
approximately the number of feet between the two points. These are but 
two instances in which the actual number involved is not known but is 
approximated by a number, the closeness of the approximation being deter- 
mined by the actual amount of information exactly known. Each step of 
the engineering student is not exactly three feet in length and the com- 
puted distance varies as the length of the step. Sometimes the exact 
value of a number could be stated, but we are satisfied with something 
which approximates it. It may have rained .U75 of an inch but most 
people would probably say it rained one-half of an inch, being satisfied 
with the message one-half conveyed in the conversation. 



In all these instances we see an actual value of a number being re- 
placed by a number which is approximately correct. For the fraction 



i we might use .3 or .33 or .3333, depending on precisely how close we 



wish, to approximate j . 

2. Approximating a root of an equation. The concept of finding a 
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number which approximates another number is also found in other circum- 
stances. If the second number is the root of an equation, i.e., a number 
which satisfies some mathematical expression, it is frequently possible 
to use the equation or parts of it to determine an approximation for one 
of its roots. This is similar to finding an approximation for, say 

i, but the significant difference is that we have an equation and the 



equation becomes the source of approximations whereas one must guess or 

develop a source of approximations for i . There is no mathematical 

equation which would determine how many people watched the walk on the 
moon. In the case of the equation, we have a basis for making an approxi 
mation. In section II we observed how early attempts to solve equations 
began with guesses and the guess was refined by making use of the equa- 
tion, In the case of the Rule of False Position a refinement produced 
the correct number and not just an approximation. The equation itself 
was used to determine what the second approximation should be. In the 
case of Homer’s Method (which we will describe shortly) the first ap- 
proximation for the root of an equation, together with the equation, pro- 
duce a second and better approximation. There are several techniques 
which are similar and the computer makes it possible to repeat the pro- 
cess as long as one wishes and one can refine the sequence of approxima- 
tions to obtain as close an approximation as one wishes. 



We give a description of Horner’s Method as a device to approximate 
a root of algebraic equation’'^. First, a real root of the algebraic equa- 
tion P(x) = s-qX?^ + a-j^x^"^ + a 2 X?^"^ + ... + ® 0 must be isolated, that 

is, it must be determined that a root lies between a and b, a < b. 

This can be done by several techniques, but the most elementary way is to 
find (guess at) two values a and b such that P(a) is opposite in sign 
from P(b) . If this be true then there exists c , where a < c < b such 
that P(c) = 0. Hence, c is a root of P(x) * 0. 

Second, the equation 



(1) P(x) - a^x?^ + a-j^x^“^ + ... + aji ■ 0 

is now transformed by replacing x by x - a, and this diminishes the 
roots by a . We now have a new equation. 

(2) P^(x) * ^1^"^ + ... + bji ■ 0 

with a root of this equation between 0 and (b - a) . We now determine two 
more values c and d such that 0<c<d<(b-a) and Pj^(c) is oppo- 
site in sign from P-j^(d). Now repeat the transformation again. Each new 



’'^We assume for all discussions that the algebraic equations are poly- 
nomial equations with real coefficients. 
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step of the sequence, 0 < c < x < d < (b - a) •» 1 produces numbers c 
and d which are closer to the root of P^(x) « 0 than previous approxi- 
mations, and, if b - a •• 1 in the beginning, the approximations get 
better ranch quicker in this process. Fortunately, this process can be 
done by using synthetic division and the computational work is con- 
siderably reduced by this procedure. 

Newton's Method (1676) is a similar procedure and it has an advan- 
tage over Homer's Method in that it can also be applied to trigono-^ 
metric, or logarithmic, or other simple functions. Homer's method is 
strictly reserved for algebraic polynomials with real coefficients. 

The technique of successive approximations where each new approxi- 
mation is a refinement of the preceding approximation is an excellent 
technique to apply to the task of finding a root of an equation. Ber- 
noulli's method of approximating the largest root of the equation 



x3 " ax2 + bx + c 

with real coefficients is one such example. The method uses a set of 
recursion formula such that successive values for a variable Aj^(a^b,c) 
when compared with the preceding value An»i(a,b,c) gave an approxima- 
tion for the absolute value of the largest root of the cubic, if that 
root were real. 

T. A. Pierce [13] gave a recursive technique for finding the least 
root of a cubic by developing Bernoulli-type formulas for approximating 
the absolute value of that root, if it is real. The formulas are strict 
ly algebraic in nature, but rather complicated to deduce. Moreover, 
other techniques seem to be more elementary and cover more types of 
equations than just the cubics. 

Other techniques employing the method of successive approximations 
can be found which use the equation and various derivatives of the poly- 
nomial function. These techniques employ mathematics beyond the secon- 
dary school level and, therefore, are not appropriate for this investi- 
gation. As an example of this type involving the derivative, see Ford 

C7]. 



Other techniques were developed in the twenties and thirties for 
solving certain polynomials. For example, Kennedy [9] showed an alge- 
braic treatment for polynomial equations of a certain kind via a loga- 
rithmic process to find approximations for the roots of those certain 
polynomials. Ptunning fill] gave techniques for finding real roots of 
cubics and quartics from graphs of certain straight lines which depended 
upon the discriminant of the equation. Grant gave still more infor- 
mation on how to solve quartic equations by graphical means . 

This brief summary of some pertinent papers on roots of polynomials 
shows the search for various techniques to simplify or avoid laborious 
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calculations necessary in finding approximations to roots of an equation 
by methods such as Horner* s Method, An entire period of mathematical 
work has been devoted to this kind of research. In some ways^ the compu*» 
ter has made some of this work obsolete^ while making other aspects of it 
even more valuable. 

3, Approximating a function by another function. One of the most 
widely used approximation procedures in analysis and applied mathematics 
is the procedure of approximating some function by another function. 

This technique is particularly useful if the original function is compli- 
cated and the approximating function is simpler. We are not mathemati- 
cally able to find the values of certain functions, but we are able to 
use these functions because we can approximate them by ones which we can 
evaluate . 

A classic example of this sort of a procedure is given by a very im- 
portant theorem from analysis. The first Weierstrass Approximation Theo- 
rem [ll asserts that 

if f(x) is a function which is continuous in the 
finite interval [a,b], then for every e > 0 there exists 
a polynomial of degree n ■ n(e) such that the in- 

equality 



|f(x) - Pn(x)| ^ 0 

holds throughout the interval [a,b]. 

The theorem is really very powerful for two reasons. First, any 
continuous functions on a closed interval can be approximated by a poly- 
nomial of degree n , where the degree n depends only upon how close 
one wishes to approximate the function. When we say any continuous func- 
tion, this includes an extremely large collection of functions which are 
not polynomials. It would include, except for certain instances or for^ 
certain levels of definition, the logarithmic, trigonometric, hyperbolic 
and mary others. 

The function f(x) = on [0,.9] can be approximated by a polynomial in 

The functions which can be approximated seem endless, except they are all 
continuous on [a,b]. 

Second, these functions are approximated by a single kind of func- 
tion, namely the polynomial. The polynomial function is also a con- 
tinuous function and, perhaps is the most used class of continuous fimc- 
tions. It is clearly simple, and it is the first kind of function dis- 
cussed in secondary school mathematics. It is a well-defined class of 
functions \\dth many facts already established and well-known. There is 
no general class of continuous functions which is simpler than the poly- 
nomial class, and at the same time, has so much known about it. 

Another theorem which illustrates this type of approximation is 
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Weierstrass’s Second Theorem [l] and it says 

If F(t) is a continuous function of period 2tr, then 
for every e > 0 there exists a trigonometric sum 

n 

Sn(t) - ao + I (aj^ cos kt + sin kt) 
k"!l. 

where n * n(c), such that 

|F(t) - Sn(t)| « e 



for all t • 

This theorem is not as general as the first theorem but illustrates 
a similar kind of procedure. Here periodic functions are approximated 
by a function of sines and cosines. The two observations about the first 
Weierstrass Approximation Theorem would also seem to be correspondingly 
appropriate for the Second Weierstrass Approximation Theorem. 

Both theorems are existence theorems in the sense that an nth degree 
polynomial exists which approximate an f(x) on [a,b3. It may be very 
difficult to actually find such a polynomial. However, it turns out that 
Bernstein [3] provided a subclass of polynomials which would do the job. 
The Bernstein theorem states that 



If f(x) is continuous on its Bernstein 

polynomials Bn, where 



n 




converge uniformly to it on as n *4 •. 

Thus we have a specific collection of polynomials which are known and 
which can be used to approximate any continuous function on [0,1]. 

h. Approximating Roots of One Function by Roots of Another Related 
Function. The last type of approximation technique to be mentioned is 
not obvious and, as we shall see later, has its difficulties and 
limitations. We shall describe the technique by using the example men- 
tioned in section I. SuppoL-^ we drop a stone into a well and measure 
the time from releasing the stone until the sound of the stone hitting 
the water reaches our ears. Suppose this time is 3«1 seconds. Then the 
time for the stone to fall can be written as tj^ "* time for 

the sound to return to the ear as to ■ where 1100 ft. per sec. is 

iiuu 



l/h© assumsd vslociiiy of sound in air t Thus "bho totiaX ■tim© ©lapsed is 
U) + tg " * jjIoS “ 3.1 

If we want to know the depth of the well, we must solve this equation for 
d . To do so, we use the technique of squaring both sides to eliminate 
tho square root and we obtain 

d ^ 6»2d d ^ Q 

(2) 1100 " 17210,000 * 



This is a quadratic in d and can be solved by the qua^atic for- 
mula but the computations involved are going to be lengthy. So we look 
for a way out. Wc observe that the well could not be very deep because 
the two times, t^^ and t 2 have to be small and, therefore, d/1100 is rela- 
tively small. As a consequence d2/l,210,000 would be very small relative 
to the other terms of (2). If we ignore this term, or consider only 



(3) 



d 



6.2d 
1100 “ 



9.61 



we find a new equation which can be solved for d . Incidentally, the 
solution of (3) for d certainly requires less computational work than 

solving ( 2 ) for d • 

Solving (3), we find d « lUl. Checking this in (2) one can verify 
that it approximately satisfies (2) and therefore, can be verified as 
satisfying (l). So a solution to our original problem has been obtained. 

The basic technique used in the solving of this problem is an ap- 
proxinficLtion* We found equation (2) which needed “bo be solved. e o 
tained equation (3) from equation (2) by deleting a certain term. The 
solution of equation (3) is an approximation for a solution of equation 
(2). Thus vje are actually approximating the roots of one function by the 
roots of another related function. The related function must in some way 
depend upon the original function, or in the case of poljTiomial equations 
depend upon the coefficients of the polynomial. 

It would appear, at first glance, that this technique might be ques- 
tionable. In fact, it is not above a blemish or ti-Jo, but it works in a 
surprisingly large number of cases. We shall explore this kind of an 
approximation in some depth in later sections. It is this procedure ^ 
which the science teachers would like to see developed in our courses in 

algebra . 

5. Comments. If one looks at these four types of approximations 
one observes that they all involve finding one number which approximates 
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another number. The basic difference is where does one get the informa- 
tion to make an approximation. In the last three types described there 
is given something about a desired number, i.e., root of an equation, 
etc., and the equation has been the basic source of information for the 
approximations. In sections 2 and U we did quite different things with 
the same information, thus providing different approximation procedures. 

IV. Some Theorems Related to Approximations . 

1. Introduction. In this section we shall cite some theorems which 
have some bearing upon roots of polynomial equations. These are not to 
be considered all that could be cited here, but are representative of 
those already available. A few will be essential for later sections and 
many illustrate related kinds of information. Some are too advanced to 
be considered at the secondary level, but those which can be proven by 
elementary algebraic means are proven. The proofs are included merely 

to illustrate their adaptability to secondary school mathematics. 

2. Locations of Roots of P(x) « 0. A short reference to the loca- 
tion of roots of a polynomial equation was given in the historical back- 
ground. We would like to expand on this here by citing several different 
types of locating theorems and concepts. At least one is elementary 
enough to be intuitively justified at the secondary level. In all the 
examples selected one determines something about the roots by examining 
the coefficients of the polynomial or by examining certain computations 
entirely dependent upon these coefficients. 

Mathematically, we are justified in doing this because the coeffi- 
cients of any polynomial equation can be expressed in terms of the roots 
of the equation. For example, a quadratic equation with roots a and 
b can be written as 



(x - a)(x - b) « 0 



Multiplying and collecting terms we have 

x^ - (a + b)x + ab « 0 . 



Thus any quadratic of the form x^ - px + q * 0 has its coefficients 
uniquely determined by the roots of that quadratic. All quadratic equa- 
tions may be transformed to this form by dividing by the coefficient of 

x^, if it is not already 1. 



A similar argument for a cubic equation produces a form 







(a + b + c)x^ + 



(ab + ac + bc)x - abc 



0 



where a, b and c are the roots of the cubic. Likewise the nth de- 
gree polynomial equation has as its corresponding form 



x^ — ^^1 ^2 ... ^ ... A a^ • 

2 ^ 



a2 • a3 










where the ± signs depend upon n being odd or even. 

Thus, every coefficient of any polynomial P(x) = 0 is a function of 
the roots of that equation and the coefficients should determine some** 
thing about the location of the roots of P(x) » 0. In what follows the 
equation P(x) « 0 shall be understood to be 

(l) P(x) = + a^x*^"^ + agX^"^ + ... + ayj = 0 

unless otherwise stated with a^_ being real numbers. 

We take as a first example of location of roots the Routh-Hurwitz 
criterion. The Routh-Hurwitz criterion provides an algebraic manlpula« 
tion of the coefficients of a polynomial to find out if the roots are to 
the right or left of the y-axis. Given the polynomial equated to zero 





P(x) «= x^ + 
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where the coefficients with indexes larger than n or with negative in-" 
dexes are replaced by zeros. 

A necessary and sufficient condition that there be no zeros of P(x) 

= 0 to the right of the y-axis is that all the Dj^’s be positive a For 
each determinant with a negative sign there is a root to the right of the 
y-axis. Further, if D]q is the only one which is negative, there is a 
single root to the right of the y-axis, and this root must lie on the 
x-axis. If is zero, the single root is at the origin. It is clear 

from this statement, which is given without proof that one can determine 
something about the character of the roots of P(x) = 0 without being able 
to actually find the roots. 

Van Vleck [l5] proved a theorem which further aids in the location 
of the roots of P(x) = 0 if the theorem is applied along with the Routh- 
Hurwitz criterion. The theorem is as follows: 
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If is real and the terms of the sequence 
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are positive, all the roots of the equation 

Co + CiX + CgX^ + . . . + ® 

are imaginary, and all but one of the roots of the equation 



are imaginary. 

Descartes' Rule of Sign is a similar kind of criterion for deter- 
mining the character of the roots of P(x) ■ 0. The rule states that the 
number of positive real roots of P(x) * 0 is equal to the number of 
changes of sign in the coefficients (when taken from down to x°) di- 
minished by 2k, where k is an integer with minimum value of zero. The 
rule simply describes the maximum possible number of real positive roots 
and not actually how many exists. Descartes' Rule of Sign applied to 
p(-x) * 0 gives the maximum possible number of real positive roots of 
P(-x) “ 0 and hence, the maximum possible number of real negative roots 

of P(x) 0. 

These criteria are helpful in determining some inf onnati on about 
P(x) “ 0 but in no way aid in solving P(x) « 0 or in approximating any 

roots of P(x) * 0. 

Consider the example 

(2) x3 + 6x2 + iix +6*0, 

the Hurwitz determinants may be computed and they give 



and the Rout h-Hur wit z criterion applied to tM.s case reveals no roots to 
the right of the y-axis. 

The Descartes Rule of Signs applied to this example indicates no 
changes in sign so there are zero positive real roots. If we replace x 
by -X in the exaraple, we obtain 

(3) -x^ + 6x^ - llx + 6 « 0 

and the Descartes Rule of Signs shows 3 sign changes so equation ( 3 ) has 
at most 3 positive roots or possibly only 1. Hence, the original equa- 
tion (2) has 3 or 1 negative roots. The Routh-Hurwitz criterion applied 
to (3) indicates 3 roots to the right of the y-axis. Hence, (2) has 3 
real negative roots. The Descartes Rule of Signs gave 3 as a maximum num- 
ber but we could not know that 3 was actually the right number. In this 
sense, the Routh-Hurwitz criterion is better than the Descartes Rule of 
Signs, but it is also more complicated to apply (in the computational 
sense) . 

A second, (and frequently useful ) piece of information about the 
character of the roots of P(x) = 0 can be found if one can determine the 
limits or bounds upon the roots themselves. By this, we mean, is it 
possible to say that all roots of P(x) = 0 are numerically smaller than 
some M, and M being a function of the coefficients of P(x)? A great deal 
of attention was directed to this kind of a mathematical problem during 
the early 1900' s and much of it is quite appropriate to this discussion. 
We cite several of these to illustrate this technique. 

If we refer to a theorem from complex variables we note that the 
region in which all roots of P(x) 0 lie is a circle with center at the 
origin and a radius of 

(U) 1 l^ki 

The |a^| max means to use the absolute value of the largest coefficient 
of P(x). For example all roots of (2) would be within 12 of the origin. 
This is true because the roots are -1, -2, and -3. For this example the 
radius is too large for it to be really effective or to be used as an ap- 
proximation for the largest root. However, it is very easy to compute. 

There are other estimates and approximations for the location of 
roots of P(x) - 0. We give three such expressions to illustrate further 
this kind of investigation into bounds on the roots. 

The expression in (U) can be replaced by other, yet similar, expres- 
sions giving the va3.ues of the radius of a circle containing all the 
roots of P(x) = 0. Walsh fl^] gave the following limits on roots. 

1, All roots of x^ + a^^x = 0 lie in or on the 

circle with center at origin and a radius of |an j. 
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2. All roots of x2 + a-j^x + a 2 ® 0 lie in or 
on a circle with radius of + Vflj. (Center is 



always at the origin.) 



3. All roots of x^ + a.x^ + a 2 X + a^ ■ 0 lie i 
or on a circle with radius ja^^l + ^Jj^j + • 



in 



For the example (2) this radius would be 



|6| + \/fl] + ^ 



or approximately 



6 + 3.3166 + 1.8171 = 11.1337 



which is some better, but not much. This sequence of values for the 
radius finally can be expressed as 



U. All roots of P(x) = x^ + a^x^^“^ + ... + = 0 

lie in or on a circle at the origin and with a radius of 



jail + ^1^2 1 ...■*■ V^l^-nl • 



Carmichael and Mason [^] proved that all roots of P(x) ■ 0 are in 
absolute value less than or equal to 



+ |ai| 



+ a 



i2T ••• l%il 



For example (2) this would be which is actually larger in value 

than the value of 12 which we obtained earlier. 



Williams [l8] later showed that the absolute value of any root of 
P(x) = 0 was less than 



v4 + ja]^ - l|^ + jag - + ... + |an - an-i)^ + 1%!^ 



Again for our example this would be )^12 , or better than the other esti* 
mates . 



From the literature on this subject one is notable to determine, 
except for special equations P(x) =* 0, when these values stand a chance 
of being a reasonably good approximation for the largest root of P(x) 

■ 0. There are several sources which give expressions for the least 
root of P(x) = 0 and we cite a few at this time. 
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Landau [lO] was able to show that every equation of the form ax^^ + x 
+1=0 has a root, the absolute value of which is not greater than 2, 

and that every equation of the form ax^ + bxF + x + 1 ■ 0 has a root, the 
absolute value of which is not greater than 8. These two observations 
were to establish a chain of discoveries and refinements which are excel- 
lent illustrations of the creative process in mathematics. 



Allardice [23 was able to generalize these results to obtain the 
following theorems: 



Theorem. The equation ax^ + x + 1 ■ 0 has a root, the absolute 
value of which is not greater than n/n-1, which is the value of equal 



roots. 



Theorem. The equation ax^ + bx?' + x + 1 always has a root whose 

n • m 



absolute value is not greater than 






Theorem. The equation ax’^ + bx™ + ox + . . . + a^^x + Oq “ 0 has a 
root whose absolute value is not greater than 



fi£ 

a- 



n 

n-1 



m 



Ji 



• • • 



n-± m-1 A -1 

regardless of the other coefficients of the equation. 

Applying Allardice *s approximation to example (2) we find this value 
18 



to be ^ , with |-ll less than this, a surprisingly good approximation. 



Landau’ s condition would have been a root whose absolute value was less 
than 8 and not a good approximation. 



About the same time Fejer [63 also generalized Landau’s results by a 
very elegant theorem. We state the theorem for algebraic polynomials 
with integral exponents although the theorem is more general than this 



case. 



Let 



Iq + a^^x + agX^ + ... + *” ® 



be an equation of n+1 terms, a.^ 0, a]^ 0. Let p be the root of this 

equation of least absolute value. Then 



p ^ n 



ao 



^1 
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1^0 

The value n hr* 
1^1 



gives both Landau's value for ax^ + x + 1 and 
Fejer's value for the same equations. For our example (2) we see 



P ^ 3 




IB 

11 



Carmichael and Mason Montel [ll] and others were able to pro- 

vide a slightly generalized statement on the last root. It is easy to 
intuitively verify these results for the quadratic equation and should 
make interesting enrichment material for some secondary school students. 



■i 

3. Some Approximation Theorems. I 



In this section we will note some theorems which have previously 
been proven and are related to our discussion. One should note that 
there are various and ^'iLverse techniques for isolated instances of root 
finding. In the following theorems we shall be concerned with the nth 
degree polynomial equation of the form 

(l) P(x) « x^ + ®1^*^ **■ ^ 2 ^“^ ••• %i " 0 



If P(x) * 0 has a numerically large real root it is possible to 
find an approximation for this root by solving the equation 



X - a-j^ = 0 

The reasoning for this is that the coefficient a-j^ in equation (l) is the 
sum of all roots of P(x) « 0. If a^. is this numerically large real root, 
then 

ai = + ci2 **■ + . . . + (Xjj 

where tt 2 >ct 3 , . . .,an ®^e the other roots of P(x) =* 0. a^ is approximately 
if is large in comparison to 0.2 Ct^ + ••• + On* However, this 

approximation idea has limited application because (X^ must be so large 
with respect to other roots. 

There is, however, a technique for finding this large root and it 
is stated in the following theorem. 

Theorem l .*^ If a-j^ is a numerically large root of P(x) ■ 0, where 



*The Theorem is said to be well-known by Oldenburger [12] and we give a 

similar proof. 
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P(x) * + a-LX^""^ + agX^"^ + ... + a^ 

Then a^ approximately satisfies the equation 

x^ + a-^x + ag ® 0 . 

Proof: Let a-j^ be the sum of the other roots a2>a~, . . .,ctn of (l). It 
follows that a-j^ = -(ci^ + rjj) • Let r^2 "^^e sum "^of the products of 

a2>ft3,...,Ojj in pairs (that is, ^2 “ ^ 2 ^^} ^ ^ 2 % '*’ ^ 3 % +...). From 

general knowledge about theory of equations the coefficient a2 can be 
written as 

&2 = rr, 2 ^a 2 + + ... + a^cf^ + + o^cti^ + + ... + otn-l®n* 

Hence, 

&2 = • (a2 + a3 + + ... + cr,n) + 0^2^^ ■*■ ■*■ ••• 

or 

&2 “ ^'l^x ^ ^2 

Since (X-]_au^ is large compared to ^2 (since (^2 does not contain a term with 
a-j_ in it), then we can say a2 is approximately a^a^. The equation 
y? + a-]_x + a2 ■ 0 can now be written as 

x2 - (a^^ + a^)x + a-j^cT]^ = 0 

and a-j^ is a root of this equation. 

There are equations for which y? + a^^x + a2 *“ 0 will not give a good 
approximation for o,^; in which case we increase the degree of the equa- 
tion by one to obtain 

y? + a-^^x^ + a2X + a^ ** 0 

A similar argument can be given to show that ctn , if large, satisfies this 
equation. One must also assume is the only '^large root, as one tacitly 
assumed in Theorem 1 . 

In fact, one may generalize this to produce the following theorem: 
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Theorem 2. If is a numerically large root of P(x) « 0, then ap- 
proximately satisfies the equation 

3^.n-l + + a2x’^"^ + ... + a^.i ■ 0 . 

The proof is similar to the proof of Theorem 1. 

A word of caution is necessary here. By a numerically largo root, 
we mean either positive or negative. The proof essentially rests upon 
absolute values and this is clear if we go back to equation (2) of 
section IV, 2. The equation x3 + 6x^ llx + 6 ■ 0 has as its largest 

root |-3|. If x^ + 6x + 11 ■ 0 is used as an approximating equation, 
the roots of it are 



o 

ERIC 

mmtim 



X 



-6 \/-8 ^ 
2 



The absolute value of x 



is 



-6 A >/— 8 




a reasonably good approximation 



for I -3 I considering that |-3| is not relatively large with respect to 
the other roots. If one takes the equation 



x^ - 9x^ - 12x + 20 * 0 

which has roots of 10, 1 and -2, the approximating equation becomes 

x2 - 9x - 12 - 0 

and its largest root is 2— or approximately 10.18, which is a 

very good approximation for the largest roots which was 10. 

We mentioned previously in section III the Weierstrass Approxima' 
tion Theorem where a function is approximated by a polynomial under a 
certain set of conditions. We restate the theorem for immediate 
reference. 

Weierstrass Approximation Theorem: If f(x) is a continuous func- 

tion on the closed interval [a,b], then for e > 0 there exists an 
n - n(e) and a polynomial Pji(3c) of degree n such that 

(2) |f(x) - Pn(x)l < € f or a ^ X ^ b . 

The polynomial Pn(x) and n are not unique. That is, one may find a 
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second polynomial of degree m which would satisfy (2) for the same e. 

Thus, polynomial approximations to f(x) exist which have a predetermined 
accuracy on Tajh^. The theorem says the polynomials exist, but does not 
give any clue as to how to find them* 

One class of functions f(x) which are continuous is the class of 
polynomials. Thus P(x), a polynomial of degree m , is a continuous 
function. In fact, it is continuous for every real number x, and there- 
fore, would be continuous for any closed interval [a,bj. The Weierstrass 
Approximation Theorem applied to this continuous function P„i(x) over some 
interval Fa,bT states there exists a polynomial of degree n which is 
within a certain e’s distance of P«(x). It is clear that one of the poly- 
nomials which approximates Pj„(x) is the polynomial P^Cx) itself. However, 
this is not necessarily the only one as we observed in the preceding 
paragraph. Immediately, two questions which are of importance to this 
study need to be answered. First, are there polynomials of degree n , 
where n 5 ! m, which approximate Fjf^vx) to a desired accuracy? Second, if 
there is a polynomial n ^ m which approximates Pj^(x) on some in- 

terval, do the roots of Pn(x) approximate the roots of P^Cx)? 

What we really would like to find is a polynomial Pn(x), n < m, 
which approximates P«(x) and where the coefficients of Pn(x) are obtained 
in some fashion from the coefficients of Pm(x). If the roots of P^Cx) 
could approximate the roots of whereever possible^ then we could 

solve for the roots of P^Cx) - 0 and obtain approximations for the roots 

of Pj„(x) - 0. 

In what follows we shall establish some conditions under which the 
answers to the two questions just raised are affirmative, and, in par- 
ticular, which allow us to do what we expressed in the last paragraph as 
a desire to be able to do. The follo\d.ng theorems, some of which were 
not found in the literature, will outline some of these conditions. We 
shall be much more specific in the section on quadratics. 

Let P(x) »* x^ + a-j^x?^“^ + & 2 X^"^ + ... + % " 0 with single roots. 

Let Q(x) be composed of all terms of P(x) except that one term, say 
1 ^ a ^ n-lj is different from the corresponding term a^x^-d 

by an arbitrarily small amount less than g over the interval [0,1]. Let 
0 be a lower bound and 1 be an upper bound on the roots of P(x) ■ 0. 

Then we have 

Theorem D3. If X 2 is a root of Q(x) « 0, then it is an approximation of 
a root X]_ of P(x) ■ 0. 

Since P(x) is continuous and Q(x) is arbitrarily close to P(x) over 
an interval [0,1] the Weierstrass Approximation Theorem holds. We know that 

jp^x) — Q(x) I * |b^x^"'3 — a^x^"^ I G for some fiiced 3 > 1 ^ 3 ^ n— 1. 
Suppose X 2 is a root of Q(x) » 0 and x^_ is the corresponding root of 
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P(x) ■ 0. We know that 

|p(x3^) - + p(x2) - QUa)! ^ |P(=^2) - Q(^<^2)| 

< € + € “ 2c 

But QCxg) ■ 0 and P(x-j^) ■ 0 so 

Ip(x 2) - Q(x^)| < 2c 

This can be written as 



|ajx|-j - bjxJ-j) < 2e 



Since positive, and we lose no generality assuming 

a j > b j , then we have 




Therefore 



|aj| • |x5“^ - 3^“^ I < 2c 



or 




but 




We have 




or 




where 



|m| - |x5“j“^ + ^ ••• ”*■ 










r 






Since jajj and |m| are finite, we have 

1*2 - *ll *= ®1 

Thus X 2 and are arbitrarily close, depending upon the closeness of 

P(x) and Q(x). Hence, a root of Q(x) ■ 0 is an approximation to a root 
of P(x) ■ 0. 

This theorem is intuitively obvious if one constructs a graph of 
P(x) and looks at where Q(x) must go. 




Since Q(x) must lie between P(x) + e and P(x) - e it must also cross the 
x-axis in some small interval about x-^. Hence an approximation for x-j_ is 
obtained at X 2 « 

It is also intuitively obvious that not all roots of Pjj(x) ■ 0 can 
be roots of any polynomial of degree n-1. The following theorems show 
this to be true and under what circumstances. 

Lema Dl. If bQ^,b 2 ,b^, . . . ,bj^ are the roots of the nth degree polynomial 
equation 

P(x) = x^ + + a 2 X^“^ + ... + a^ = 0 
and if ajj 0, then 0, i = l,2,...,n. 



f 

i 
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Proof* Let be a root of P(x) ® 0. Since iK 0, then bj^ 0. For 
if bi ■ 0, then 

(bi)" + a^Cbi)""^ + ... + a„ - 0 

and every tenn containing a b^^ is also zero and this would mean a^j would 
also have to be zero for b^^ to be a root of P(x) ® 0. Since a^ ^ 0, then 
bi ^ 0. 

Theorem DU. Let b]^,b 2 ,bo, . . .,b„ be the roots of the nth degree polyno- 
mial equation 

P(x) - x^ + + a 2 x’^"^ + + a„ ® 0 

Further, let a^ be non-zero. Then no root of P(x) - 0 can be a root of 
Q(x) ■ 0, where Q(x) is obtained from P(x) by deleting the x'' term. 

Proof: Assume bj also is a root of Q(x) ■ 0. Then Q(bi) ■ 0 and 

we already know P(bj^) ■ Oj but P(x) ■ x^ + Q(x) for all x , and 
therefore, 

P(bi) - (bi)'' + Q(bi) 



or 

0 ® (bi)'' + 0 

The only way this last equation can hold is for ■ 0. But by the lemma 
bi»'0, therefore^ bi cannot be a root of Q(x) ■ 0. 

Theorem D^. Let bi, (i * l,...,n) be the roots of P(x) - 0. Let Q(x) 

- P(x) - a^xF'"j, j ® 1,2,..., (n-1), a^ 0, a^ 0. Then no root of 

P(x) ■ 0 can be a root of Q(x) ® 0. 

Proof: Since a^ 0, then b^ / 0, Assume bi is a root of Q(x) ■ 0. 

• ■ 

Then we have Q(bi) ■ P(bi) - aj(bi)*^ ^ or 0 ■ 0 - aj(bi)'' Since 
aj 0 and (bj^)""^ / O^then b^ cannot be a root of Q(x) ® 0. 

In both of these theorems the condition a^^ ^ 0 holds and the theo- 
rems hold for large collections of polynomial equations, but not all. 

For example, if P(x) - x^ - x^ + x^ - x » 0, a root of this equation 
satisfies the equation 

Q(x) = -x3 + x^ - X ® 0 



or 
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Q(x) » - X » 0 . 

It is clear that the root which does this is the zero root. We will now 
examine the cases in which a^^ ■ 0 and see if we can enlarge the collec- 
tion of polynomial equations which satisfy Theorems DU and If ■ 0 
then the polynomial equation P(x) ■ 0 can be written in the following 
manner: 

P(x) « x^ • R(x) where k - l,2,...,n • 

If a.jy is the only zero constant of P(x) then k ■ 1. R(x) is a polynomial 

of degree n-k with the constant sin-k being nonzero. This expression for 

P(x) is obtained for any P(x) with a^j ■ 0 by factoring the highest power 

of X from P(x) and R(x) is the remaining factor. One may deduce two 
interesting relationships between the roots of P(x) ■ 0 and R(x) *0. 

Theorem D6. If a^j ■ 0, b-j^ ■ 0 and P(bQ^) ■ 0, then R(b-|^) / 0. 



Proof: Let 

P(x) ■ x^ • R(x) 



or 



where 



P(x) 



X' 






n-k 






%1-k) 



^n-k ^ • 

(If aji-k were equal to zero, an additional factor of x could be removed 
from R(x) and some other nonzero coefficient would be the constant term 
of a new R(x).) Then consider 

R(bQ_) ■ b^ ^ + aQ^(b2^) + ••• + %i-k • 

Since b-j^ = 0 we have 

R(bi) * R(0) ■ ^ ^ + ... + 



and 

R(0) “ ajj«k • 

Hence, the zero root of P(x) - 0 is not a root of R(x) - 0. 

Theorem D7. If ajj « 0, and b^ is a nonzero root of P(x) = 0, then R(bj^) 

« 0 . 
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Proof: Let P(x) " x*' • R(x). If bi j< 0 and is a root of P(x) - 0, we 

^ have 

P(bi) - 0 - (bi)*' . H(bi) . 

Since bj^ ft 0, (bi)** / 0 and consequently, R(bj^) must be zero. Hence b^ 

is a root of H(x) ■ 0. These two theorems together say that the nonzero 
roots of P(x) " 0 (where a^ ■ 0) are the nonzero roots of R(x) ■ 0, and 
any zero roots of P(x) ■ 0 are not roots of R(x) " 0, Lemma 1 further 
says that R(x) ■ 0 has no zero roots. We shall now reconsider theorems 
DU and DS in a broader sense. 

Theorem D8. Let be roots of the nth degree polynomial equa- 

tion 

P(x) ■ x^ + a-^x*^"^ + a 2 x’^“^ + ... + a^ ■ 0 

Then no nonzero root of P(x) ■ 0 can be a root of Q(x) - 0 where Q(x) is 
obtained from P(x) by deleting the x^ term. 

Proof! If an 0, then DU holds. If an « 0, then P(x) ■ 0 takes on the 
form 

P(x) ■ x^R(x) ■ + a-j^x”"^"^ + ... + an-k) “ 0 



Then Q(x) ■ P(x) - x". If b^ 0, then 

Q(bi) - P(bi) - b5 . 

Since, by hypotheses P(bi) * 0 and (bj^)”V 0, we have 

Q(bj) « (b^)*' 0 

and bj^ is not a root of Q(x) ■ 0. 

Theorem D9. Let b^, i - l,2,...,n be the roots of P(x) ■ 0 (where P(x) 
is the polynomial of Theorem D8). Let Q(x) ■ P(x) - ajX^"J where aj 0. 

Then no nonzero root of P(x) ® 0 can be a root of Q(x) ■ 0. 



Proof: If an 0, then Theorem D^ holds. If an ■ 0, then we can write 

P(x) « 0 as x^ • R(x) ■ 0 as we did in Theorem D8. Then Q(x) 

- x^ • R(x) - ajX?^“^. If bi 0, then we have 

Q(bi) = (bi)*' . R(x) - aj(bi)""^ 

but, by Theorem D7, if P(bj^) « 0, P(x) * x^ • R(x), and bj^ / 0, then 
, R(bj^) ■ 0, and 
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or 



Q(bi) - bi • 0 - aj(bi)”"^ 



Q(b^) « -aj(b^)”"^ 0 . 

Hence b^ cannot be a root of Q(x) - 0. The sequence of theorems have 

shown that a nonzero root of a polynomial equation P(x) ■ 0 cannot be a 
root of a polynomial equation obtained by deleting some nonzero term of 
the original polynomial equation. 



If we examine Theorems D3 sind D9 together, we may observe one vei*y 
interesting possibility. Theorem D9 says no nonzero root of P(x) * 0 

can be a root of Q(x) « P(x) - a^x’^’j - 0. Theorem D3 says if P(x) and 
Q(x) are alike except for some term a^x^"^ and these terms are arbi- 
trarily close, then a root of Q(x) - 0 is an approximation to a root of 
F(x) * 0. The roots cannot be the same but can be close to being the same. 
Theorem D3 can be used on the P(x) and Q(x) of Theorem D9 if s-j is rela- 
tively small with respect to the other coefficients and Q(x) then can be 

taken to have no term in it. In this case we have the following 

theorem. 

Theorem DIO. If P(x) * 0 has a term ajX?^"j with a^ relatively small ^th 
respect to the other coefficients of P(x), then "^the polynomial Q(x) 

P(x) - ■* 0 has a root which approximates a root of P(x) ■ 0. 



If ajX^“j is relatively small on some interval then, by the Weier- 

strass Approximation Theorem, Q(x) is an approximation for P(x) over this 
interval. Hence, by Theorem, D3 Q(x) * 0 has a root which is an approxi- 
mation for a root of P(x) = 0, and the theorem is proved. 



In the section on the quadratic equation we shall see the implica 
bions of this theorem in application. 



V. Some Approximation Techniques for Quadratic Equations . 

1. Introduction. We have already presented some general theorems on 
approximations of polynomials of nth degree. Now we will direct our 
attention to the quadi-atic equation and the approximating equations for 
the quadratic. The quadratic is especially important because it is the 
most elementary polynomial, besides the linear one, which is found in the 
algebra taught at the secondary school level. Moreover, we can be more 
specific for this polynomial in several ways and this makes the approxi- 
mations more valuable. 



We will not consider the case where we approximate the roots of a 

Uo 












quadratic by the roots of another quadratic. Since we would have to 
solve the second quadratic anyway, the effort involved might just as 
well be directed toward solving the first quadratic. It is also true 
that, for elementary problems, this is the least interesting case. We 
will consider approximating the roots of a quadratic by deleting a term 
of the quadratic* and solving the resulting equation for its roots. This 
will be done by considering the various possible types of roots of the 
quadratic equation and the various terms which can be deleted for each 
type of root considered. We will be concerned with the deletion of only 
one term from the quadratic, and there will be a total of six cases to 
be considered. 

2 . The Quadratic Equation with Real .Roots. Write the quadratic equation 
P(x) = 0 in the form 

P(x) « x^ - (a+b)x + ab ® 0 

where a and b are the real roots of P(x) « 0 and a b. We now con- 
struct Q(x) - 0 from P(x) - 0 by deleting a term of P(x) « 0. 

Case A. The first term we shall delete is x^. Let 

Q(x) = P(x) - x^ * 0 . 



Then 



Q(x) = -(a+b)x + ab = 0 . 



We wish to know how the root of Q(x) « 0 is related to either root of 
P(x) = 0. Solve Q(x) * 0 for x . Since 

Q(x) = -(a+b)x + ab = 0 



we have 



X ** 



ab 

a+b 



Note that the expression ^ is a symmetric one in a and b, so 
whatever we might say relative to a could be said relative to b . 

Since a b, let a be greater than b . If a is large relative to b 

the expression becomes relatively close to b , i.e., is an ap- 
proximation for b if a is large relative to b . But this means that 
approximates a root of P(x) ■ 0 under this set of conditions. The 




related question of just how good an approximation one has can be deter 
mined by looking at the follovdng table. 



Table I - Values of ^ 







a « 8b 


a ° 20b 


a « 100b 


a 


a - 2b 


a - Ub 


b 


b 


b 


b 


b 


b 


ab 

a+b 


2 . 




8 *, 
9 


20 n 


100 , 
1(51’’ 











As one observes in the table the expression ^ takes on values whxch get 

closer and closer to b as a becomes larger (relative to b) . In fact, 

the limit of is b , as a increases without bound. Moreover, from 

the conditions described for this particular case, the approximation ob- 
tained is always for the smallest of the real roots. 



If 

la| > b, 
below. 



is sufficiently large in the negative direction, 
he same sort of approximation is obtained. Examine the table 



Table II - More Values of 



a “ 


-2b 


-lib 


•^b 


-iOb 


-100b 


b = 


b 


b 


b 


b 


b 


ab ^ 
a+b 


+2b 


li 

3 


■ 8b 
7 


10b 

“9 


100b 

99 



Thus is an approximation for b if ]a| is greater than b . The 
a+b , 

closeness of the approximation, for a given b , depends on y on e 

value of a . 



If one takes the values found in Tables I ^ and puts this^^^ 
praohical form one obtains the eqviilateral hyperbola for a g^^P"* ^ 
-AdS one fixes b at sar.e value. Table III is one such graph for a 
fixed b . The graph reflects the two observations made about 

■— being an approximation for b , provided |a| is relatively large. 

oL** D 



Or. 



e should also note that ^ does not approximate b in the central 



region of the graph where a is close to -b. In the event a is close 
to -b in value we observe (a*b) is very small and the =?f 
is small with respect to the other terns of the quadratic. This gi 
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Table III - Graph of , 
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clu as to wh<-'n wo can 
o.'*‘ P(x) " 0. We 



■lepend upon being a f;ood approximation 
!onrio.lize this in the following theorovi. 



Theorem i)ll. If P(x) = x2(oi-b)x 
P(xj =0. a 7 ^ 0, b 0 and |c.| > b, 

Q(x) = P(x) - 



ab « 0, with a,b real roots 
then the equation 

x2 «= -(a+b)x + ab = 0 



of 



ab 

nas a rc >t, x = which is an approximation for the smallest root of 



P(x) "• 
son to 



the coefficient of x^ is relatively small in 
t.'.e coefficient of x and the constant term a*b. 



compari- 



Gornment J. . Note that if either root is zero, we have a 
tic v;hicii need not be solved by approximation. 



special quadra- 



Comment 2, Note that the key as to when one can depend upon to be an 

a+b 

approximation of a root of P(x) = 0 is the coefficients of the polynomial 
P(x,. Ive have written P(x) - 0 in the form x^ - (a+b)x + ab = 0 so as to 
reduce the number of unknown coefficients. If the coefficient of x2, 

i.e., 1, is small relative to j-''^a+b)| and |abj then provides an ap- 
proximation to a root of P(x) « 0. 



Comment 3. The theorem does not say so, but since ja| > b, the root ap- 
proximated is, numerically, the smallest. 



Comr.ient I|., The error ol approximation is likewise not given in the theo- 
rem, but it is easily determined by computing the followings 



Error 



b 



ab 

a+b 






However, this requires a knowledge of what b is and if we must know b 
in order to compute the error then we already know a root of P(x) » 0 and 
all this approximation business is needless. However, the Tables I and 
II suggest a means to determine relative error and Table III gives enough 
of a graphic picture to approximate the relative error. Error and rela- 
tive error are strictly functions of two variables, a and b , and, 
therefore, for any fixed equation P(x) =* 0 the amount of error is pre- 
viously fixed by the roots of the equation. 



Corument This theorem justifies the procedure used in the introduction 
to solve a certain well proolem and puts the procedure on a sound mathe- 
matical basis. 



Case B. Consider how Q(x) = P(x) - (a+b)x =* 0; that is, take P(x) « 0 
and delete the x term. We are still assuming a and b are real 
roots of P(x) and a ^ b. Then 



Q(x) = x^ + ab « 0 

hh 



and 



y? a -ab 



for y? ■ -ab to have real roots, either a or b must be negative. 

Assume a is. Then b is positive and -ab is positive. Hence 
X « ± ^-ab and the latter is a real number. But v-ab can be ai' appro^- 
mation for b only when a « -b. In either case the expression x « V-ab 
can approximate a root of P(x) « 0 only if a is approximately equal to 
-b. Vflien this happens the coefficient of x , i.e., -(a+b) becomes very 
small and approximately zero. Thus the term being deleted from PCx) ■ 0 
has a relatively small coefficient in comparison to the other coeffi- 
cients in the equation. 

For example, the equation 

y? + .Olx ■ (12)(11.99) ■ 0 

has a root which can be approximated by a root of x^ - (l2)(ll.99) ® 

i.e., X ■ ;t\/l2(ll .9^) 

In fact, we have approximations for both roots. These observations can 
be stated in the following theorem. 

Theorem D12. Let P(x) * - (a+b)x + ab « 0, with a and b real, .. 

a 0, b 0. Let j-(a+b)l be relatively small as compared to 1 and 
|ab|. Then, both roots of P(x) « 0 are approximated by the roots of 
Q(x) =» x2 + ab = 0. 

Comment 1. If -(a+b) is approximately zero, then a is approximately 
the negative, of b , Thus a and b must be real roots, otherwise ab 
would be complex or numerically smaller than a+b and neither are to be 
allowed in the theorem. 

Case C. Now vie consider the case where Q(x) =* P(x) - ab - 0 or 
Q(x) = x^ - (a+b)x * 0. We still assume a and b are real roots of 
P(x) = 0, a b. Solving Q(x) ■ x^ - (a+b)x * 0, we obtain 

X * 0 and x “ (a+b) . 

If ab is to be relatively small with respect to 1 and |-(a+b)|, then 
either a is close to zero or b is close to zero. If a is approxi- 
mately zero then |-(a+b)| is approximately b . In this case the two 
roots of Q(x) = 0 give approximations for the two roots of P(x) * 0. 
Similarly, if b is approximately zero, |-(a+b)| is approximately a 
and again we have two approximations for the roots of P(x) ■ 0. 

We have, therefore, the following theorem. 

Theorem D13. Let P(x) - x^ - (a+b)x + ab « 0, with a and b real. 
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a 0, b 0, a > b. Let |abl be relatively small as compared to 1 and 
l-(a+b)l/ Then the two roots of P(x) = 0 are approximated by the roots of 

Q(x) = x2 - (a+b)x « 0. 

3. The Quadratic with Complex Roots. We consider now the cases where the 
roots are complex. Suppose 

x^ - a^_x + a2 - 0 

has complex roots. Let a+bi and a-bi be these roots. ^ [We know 

roots of P(x) « 0 occur in pairs and are con, jugate pairaj Thus the quad 

ratio becomes 

% 

x^ - (2a)x + (a^+b^) * 0 

Now consider the various equations obtained from (l) by the deletion of 
a term. 

Case A. We begin by considering the deletion of x^. Then 

Q(x) * -2ax + (a^+b^) = 0 . 



Solving for x we have 



a2+b2 

"Ta^ 



or 



l 



a b^ 

2i 



If a is relatively large and b is relatively small, th^ value 
for X becomes approximately a/2. If b is not small, then b /2a 
becomes a significant part of the value for x and x may 
ger than a/2. But if b is small in comparison to a ^^^^en the loot of 
Q(x) = 0 enables us to find an approximation to a root of P(x; * 0. ihis 
seems to be a contradiction for x is a real .n^ber from Q(x) * 0 ^nd 
the roots of P(x) are complex. This contradiction is no longer a con r 
diction if one locates these values in the complex plane. The roots ol 
P(x) = 0 and Q(x) - 0 are put on the graph as follows: 
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Since, b is relatively small the roots of P(x) ■ 0 lie close to the 
x-axis. The root o*' Q(x) ■ 0 is a/2 and it does not approximate either 
root of P(x) » 0, but a does'. Thus, for this case, Q(x) ■ 0 gives a 
value which can be multiplied by 2 to get an approximation of either root 
of P(x) ® 0. Therefore, we may state the following theorem. 

Theorem Dili. If P(x) •* x? - a^^x + ap ■ 0 has complex roots a bi, and 
a is relatively large and b is relatively small, then the root of 
Q(x.) « -a^iX + a2 ■ 0 (or Q(x) P(x) - • 0), when doubled, becomes an 

approximation for the roots of P(x) *0. 

Case B, Instead, of deleting from P(x) * 0, consider the Q(x) obtained 
by deleting the x term. We have 

Q(x) ** x^ + (a^+b^) = 0 

Solving for x we obtain 

X « ± \/(a^+b^) 



or 



X 



* i 




If a is relativoj.y small, then the value of (a^+b^) is approximately 
b2, ar.u thus x is approximately i*b. Hence, if a is sinall, the 
solutions of Q(x) - 0 are approximations to the solutions of P(x) ■ 0. 
Again this becomes obvious if we locate these values on the complex plane. 




Thus wo have the following theorem. 
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Theorem Dl^. If P(x) « - a-j^x + a 2 » 0 has complex roots a ± bi, and 

a is relatively small and b is relatively large, then the roots of 

Q(x) - x^ + (a^+b^) - 0 (where ag « a^+b^), then the roots of Q(x) - 0 
are approximations for the roots of P(x) - 0. 

Case C. If we delete the constant term we know that Q(x) ■ 0 is defined 
to be P(x) - ag = 0 or 



Q(x) ■ x^ - 2ax ■ 0 . 

The roots of Q(x) - 0 are x « 0 and x » 2a. If the roots of P(x) - 0 are 
a J: bi, then the only way for the roots of (x) to approximate the roots 
of P(x) = 0 are for the roots to be close to zero or close to a . In 
the first instance, a and b would both be close to zero and so would 
a bi. Thus both roots of (U) are approximations to the roots of P(x) 
=0. In the second instance, if b is relatively small then 1/2 of the 
root X ■ 2a will give an approximation to a i bi. 

From this analysis we can state the following theorems: 

Theorem Dl6(a), If P(x) ■ x^ - a^^x + ag « 0 has complex roots a * bi and 

a is relatively large and b is relatively small, then the nonzero root 
of Q(x) ■ x^ - a-j^x * 0 multiplied by 1/2 is an approximation for either 
root of P(x) * 0, 

Comment: The roots of P(x) - 0 lie close to the x-axis on a graph and 

thus a is an approximation for either root. Notice the similarity of 
this case to Case A considered under the section on complex roots, 

We also have one other theorem we may state at this time. 

Theorem Dl6(b)., If a is also relatively small in Theorem Dl6(a), then 
both roots of Q(x) « 0 are approximations to the roots of P(x) = 0. 

This must be true for if both a and b are very small, then a * bi 
both lie close to the origin. Hence either 0 or 2a will be approxima- 
tions for a db bi. 

It is interesliing to observe that in all cases of complex roots the 
amount of error of an approximation to a root of P(x) = 0 obtained from a 
Q(x) =» 0 depends almost entirely upon just one part of, the complex root, 
that is, either the real part or the imaginary part. The graphs asso- 
ciated with the various theorems of this section show this rather clearly. 

U. Comments and Observations, In every case considered we were able to 
deduce conditions under which Q(x) ■* 0 produced roots which were approxi- 
mations to P(x) ■* 0, In each case the condition contained some stipula- 
tion upon the roots in order to insure that approximations were possible. 
If we do not know what the roots are, then we cannot insure whether the 



1 

I 

I 



theorem applies or not. We could make some In' 

ceed on the basis of the the 

six cases we have considered, and no 

This means we can use each theorem under just 

others . 

Take an example. Suppose P(x) - f - °^?hout) 

6 theorems we ®(*^^°o^^rrDll"^f DIU^ two a?e the only 

sris ssr:s fJSi J r A - ”T?rs “Ssi jut- 

cient. In both theorems we used Q(x) = -7* + 9 - 0 to approxxmaue 
root of P(x) - 0. Thus x - ^ is an apprcjximation to the smallest root 
of P(x) = 0 regardless of whether P(x) » 0 has real or complex rootsl 

If P (v^ ” 0 has complex roots, we know that + b 9 must hold 

and b = . Put for Theorem Dllt to apply b must be relatively 

small with relpect to a and this is not true in this case. Hence, Dl!+ 
is out, leaving only Dll. 

If P(x) ® 0 has real roots (and it has) then -(a+b) *^-7 and one 

root is approximately | so the other must be +7 - | °r + Y’ 
observe that we were justified in using theorem Dll. 

If one really wished to insure the right selection, check the coef- 

• +e n-p pfv'i ■ 0 and compute the discriminant. Not only does the 
ficients of P(x) - 0 ^3^1 or complex, but the size 

S^th^Ssfri^inait^hS it is neptive determines when the complex roots 

may lie close to the real or imaginary axis. 

It turns out then that we do not really need to taow the r^ts to 

S“?( .T„-!UUSS 8S"TTiU UrU £ Sf ipS. 

eLffioients S P(x) = 0 are specific functions of the roots and, hence, 
Telfus aS Lch ai the rootedo if we but know how to read them. 

One last observation which can be made about quadratic has to do 
with two quadratic functions which differ by a small amount. 

Let P(x) * x2 - (a+b)x + ab, where a and b are roots of 
To+ ofv'i - P(x) - e where Q(x) is a quadratic which differs from 

;; - 

case we have 



Q(x) * x^ - (a+b)x + ab - 6 



0 



Solving by quadratic formula we have 






2 



( 1 ) 



X 



(a+b) + (a-b) 
2 



+ \/e ** a + 



and 



( 2 ) 



X ^ 

2 



(a+b) - (a-b) 



\/e « b - \/e 



Thus the roots of Q(x) * 0 differ from the roots of P(x) ■ 0 by at 

most v'c. When Q(x) differs from P(x) by exactly e. A similar statement 
can be made regarding a pair of cubic functions which differ by exactly 
e , except the roots of one cannot differ from the roots of the other by 



the cubic P(x). It is probably true that a corresponding statement can 
be made for the nth degree polynomial, 

VI . Approximations and Other Equations . 

On the basis of what we have already seen, one would hope that the 
concepts of sections IV and V could be extended to other polynomial equa- 
tions of higher degree. The problem of such an extension is that we can- 
not always solve an nth degree equation and, therefore, don’t know if 
approximations to the roots of one equation can be actually computed from 
a derived equation. The Weierstrass Approximation Theorem and Theorem 
DIO state that approximations do exist but finding them is another matter. 
Even the cubic equation presents some serious problems as far as applying 
the concepts of section V. We shall examine this one very briefly. 

Suppose P(x) = x3 + a^x^ + a2X + » 0, If we delete certain terms 

one by one we have the following derived equations; 



more 



than K • ^ , where K is a specific function of the coefficients of 
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0 



Qj(x) - x3 + aj^x2 + “ 



and 



Qj^(x) = + a2X » 0 



Equations QpCx) » 0 and Q 3 (x) ■ 0 are both cubics which would ha e 
to be solved to obtain solutions which approximated solutions ror 
P(x) - 0. There is really no advantage in solving these as precisely 
the same kind of effort would have to be exerted as would have been 
necessary to solve the original equation, which is 

might just as well solve P(x) = 0 in these oases and obtain the actual 
roots. 

In the case of Q).(x) we are assuming one root of P(x) - 0 is approxi- 
mately zero and the other roots are readily obtained ^ " 

ratic equation. One could, if the theorems 

of the quadratic via the theorems of section V but this would be finding 
approximations to approxdjnations to the actual roots and this could b 
unsatisfactory as a process. 

The only derived equation which shows any promise is Q;j_(x) ■ 0, 
since it is a quadratic derived from P(x) * 0 on the condition that 
ai, ag and are all relative?i.y large with respect to the coefficient 

of x^, which is 1. Without going into all the possibilities (which are 
really beyond the level intended in this study) we will cite some examples 
to show how the- approximations turn out. 



Example 1. If P(x) * x3 - 9x^ - I2x + 
+ 20 ■ 0 and we have 



20 



0, then Qj^(x) = -9x2 _ i2x 



-(- 12 ) ± 



\Z-12)^ 



lt(-9)(20) 



,2(-9) 



o 







12 * v®; 

V B ~ 

-18 

Taking \/35II to be 29.39 we find x to be approximately ®ad 

^7.39 _ three roots of P(x) = 0 are 10, -2 and 1, Thus the roots of 

Qj^(x) give approximations to the two smaller roots of P(x) = 0. The 
third root of P(x) « 0 can be approximated by the expression 

51 







This is true because 20 is the ne^;ative of the product of the three roots 

libcample 2, If P(x) = - lOx^ - 6x + 200 « 0 then Q-jix) » -IQx^ - 6x 

+ 200 » 0 and 

6 v®536 

^ l20 • 



Taking \/B03S to be 89.6 we find 
P(x) ® 0 are 7 rf- i 
of P(x) « 0. 



X to be -U.7'3 and U.lB. The roots of 
root of Qi(x) approximates the real root 



Example 3- 

+ 20 == 0 . 



If P(x) 
But 



« x3 + 13x^ + 32 x + 20 « 0 then Qx(x) 



X “ 



“32 \/“l6 

'2S 



13x^ + 32 x 



and P(x) = 0 has only real roots of -10, -2 and -1. If we consider 




“32 

26 



we find 




31.7 



and this is numerically an approximation for the smallest root. 



These examples are not intended to take the place of the theory 
which could be developed, but merely serve to show that, in one case, the 
concepts of section V on the quadratic are extendable, with modifications 
and additional conditions, to the cubic equation. Since the solutions of 
the general cubic equation are not a part of the algebraic materials ^ at 
the secondary school level, there seems to be little need in developing 
the approximation theory for this presentation. 



It should be noted that approximations to solutions of an nth degree 
equation can be obtained by methods such as Horner’s method or others of 
a similar nature. These methods are adequately described in any of the 
books on theory of equations (for example see Dickson, Theory of Equations). 



VII. Implications for Secondary School Mathematics . 

In this last section we wish to relate the previous sections to the 
business of teaching secondary school mathematics and indicate some im- 
plications for the future. We do this in the context of answering the 
original basic question which motivated this entire investigation. 

We wished, if possible, to determine if there was a systematic body 



of mathematical knowledge which ^ 

root to a polynomial by finding roots of a derived 

were possible to present this knowledge in some convenient fom 

a nropram of instruction could be established in courses in elements^ 

algebra at the secondary school level. We believe the elaborate 

two part question is in the affirmative for both parts and we elaborate 

by giving our reasons for this belief. 

In reviewing section III of this Appendix, we note the four types 
of approximation! listed include three for which an extensive amount of 
structAired mathematics exists. The three are 

1, Approximating a number by another number 

2 , Approximating a root of an equation 

3, Approximating a function by another function 

Of these three, the first two are quite old techniques, which we hope 
t he hi!tori!!^material of section II amply indicates, f®. techniques 
Svolted in these two range from simple arithmetic computations to such 
complicated computations as finding Sterling's formula or o • 

similar nature from the theory of numbers area in mathematics. Approxi- 
mSSrr!!tr!f an equation can be as simple as passing or ®^ 

ted as using recursive formulas depending upon the derivative of the 
o!lv!!mSl equations. In either case, these procedures are well pundp 
Lnd^re an integral part of the mathematpal stpcture f 

occur at various levels within the structurp In fact.tte first educa 
tional implication might very well be stated at this point. 

In the process of modernizing coll ege ^d university level matop 
rm+TPo we have for the most part, discontinued several courses as a 
main po-tion of the mathematics required of those wishing to be matpma- 
SSa!!'(a!d 1 include teachers in this group). Of particular m- 
tanoe to this study and most closely relatp of all j. 

course offered by the title of Theory of Equations. This , 

has nearly disappeared from the college catalog. There are ^ ^ ^ 

reasonffor this. Much of the content of such courses apparently has 
"inbsorLd L;o other courses, most notably abstract algebra 
However if one examines these other courses one finds the so-called 
Semr.^Iry concepts have frequently been excluded because they are too 

One may find some of these elementary concepts in a College 
Alrebra course, but not all. With more and more students coming to 
college with College Algebra as a background in mathematics, it seems 
that the algebraic content of the secondary school must include, in a 
sifmificant ’way, the basic concepts from the theory ol equations. 
Sf’eSe is Le phrase, "in a significant way". This would mean that 

techniques for finding approximations to roots of°secon- 

Tiorner's method, would be an appropriate part of the content ol secon 
!larr"choo^ It would seem that this would be an essential 

»)art of ariv accelerated orograni a,t that level, ihe mathematics is not 
C difficult for the secondary school. We need fi"e proper 

hind of presentation to make such a program not only feasible, but prac 
tiefa in impLmentation. In fact, the major accomplishrrent in mathema- 
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tics education for the next generation might very well be the creating of 
elegant but simple ways to present the more complex mathematics at an 
earlier stage of instruction than at present. This would be only one of 
many possible areas in which this could be done. 

The third type of approximation referred to in section III, approxi- 
mating a function by another function, is also a part of an existing 
mathematical structure. In fact, it is a foundation principle for the 
area in analysis which has made possible much of our current mathematics, 
particularly the applied mathematics . Much of this mathematics depends 
upon the calculus and, therefore, is not adaptable to the secondary school 
curriculum. Moreover, there appears to be no mathematical reason or need 
to suggest the necessity for such an adaptation. 



With respect to the fourth type of approximation given in section 
III, it is possible to make more positive statements of implication for 
secondary school mathematics. This type of approximation, the approxi- 
mating of roots of one function by roots of another related function, was 
examined in some detail with vo,rious relations spelled out in the theo- 
rems of sections IV and V. The following observations seem to be perti- 
nent to these sections. 



f 1. There is a mathematical structure available for this type of 

approximation technique. It is not as clearly delineated in the litera- 
I ture as the other types and, for the purposes of this presentation, seve- 

ral gaps were found. 

2. It is possible to establish certain theorems, based essentially 
upon concepts from the theory of equations and the elementary algebra 
which show the connections necessary to fill the gaps noted, 

3. The essential features of the mathematical structure are noted in 

I ! these two sections, first in a general way and then specifically for the 

quadratic equation. This expository presentation xiias not intended to be 
I complete in every detail and perhaps there is a more elegant presentation, 

I but the features presented show that it is possible to put some systema- 

tic order to this type of approximation. 

h. Most of the concepts presented are dependent upon very elemen- 
I f tary mathematics of the secondary school level and, hence, are adaptable 

■ to this level. By implication, this means much of the material presented 
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herein can be made a part of mathematics instruction at the secondary 
level. The material presented in this Appendix is intended for the high 
school teacher who will have to make the necessary adaptation to the 
cl.assroom situation. There does not appear to be any source available 
which has done this for the teacher. 

The fact that it appears to be feasible to adapt these concepts 
to secondary school mathematics and the fact tha,t there appears to be at 
least a suggested need for this material seems to imply that such an 
adaptation should be attempted. The basic suggestion here is that a por- 
tion of time and effort be directed to making the high school student's 
mathematics more relevant to his science instruction. (The same impli- 
cation could be made relative to the social sciences, but from other 
facts not necessarily related to approximation theory.) 
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6. Tbe presentation of this approximation theory should probably 
be more intuitive than rigorous. This is essentially a psychological 
question or observation and we have ample evidence from 
the School Mathematics Study Group that such an 

;ind that it actually works in the classroom on concepts in mathematics . 

In summary, there is a significant area of mathematics in apprOTi- 
mation theory which is elementary enough to be adaptable to second ry 

matSatics. It is mathematically possible to m^e --oh “ adap- 
tation, and, if properly presented, this adaptation could make a oontri- 
bution to mathematics and to science instruction at that level. 
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